Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

نویسندگان

Eugene A. Feinberg

Pavlo O. Kasyanov

Michael Z. Zgurovsky

چکیده

Authors are encouraged to submit new papers to INFORMS journals by means of a style file template, which includes the journal title. However, use of a template does not certify that the paper has been accepted for publication in the named journal. INFORMS journal templates are for the exclusive purpose of submitting to an INFORMS journal and should not be used to distribute the papers in print or online or to submit the papers to another publication.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

This paper presents sufficient conditions for the existence of stationary optimal policies for averagecost Markov Decision Processes with Borel state and action sets and with weakly continuous transition probabilities. The one-step cost functions may be unbounded, and action sets may be noncompact. The main contributions of this paper are: (i) general sufficient conditions for the existence of ...

متن کامل

Context-Driven Predictions

Markov models have been a keystone in Artificial Intelligence for many decades. However, they remain unsatisfactory when the environment modelled is partially observable. There are pathological examples where no history of fixed length is sufficient for accurate prediction or decision making. On the other hand, working with a hidden state (like in Hidden Markov Models or Partially Observable Ma...

متن کامل

On the Convergence of Optimal Actions for Markov Decision Processes and the Optimality of (s, S) Inventory Policies

This paper studies convergence properties of optimal values and actions for discounted and averagecost Markov Decision Processes (MDPs) with weakly continuous transition probabilities and applies these properties to the stochastic periodic-review inventory control problem with backorders, positive setup costs, and convex holding/backordering costs. The following results are established for MDPs...

متن کامل

Processos de Decisão de Markov: um tutorial

There are situations where decisions must be made in sequence, and the result of each decision is not clear to the decision maker. These situations can be formulated mathematically as Markov decision processes, and given the probabilities of each value, it is possible to determine a policy that maximizes the expected outcome of a sequence of decisions. This tutorial explains Markov decision pro...

متن کامل

Dialogue Control Algorithm for Ambient Intelligence based on Partially Observable Markov Decision Processes

From the viewpoint of supporting users’ natural dialogue communication with conversational agents, their dialogue management has to determine any agent’s action, based on probabilistic methods derived from noisy data through sensors in the real world. We believe unique Partially Observable Markov Decision Processes (POMDPs) should be applied to such action control systems. The agents must flexi...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Math. Oper. Res.

دوره 41 شماره

صفحات -

تاریخ انتشار 2016

Partially Observable Total-Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

نویسندگان

چکیده

منابع مشابه

Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities

Context-Driven Predictions

On the Convergence of Optimal Actions for Markov Decision Processes and the Optimality of (s, S) Inventory Policies

Processos de Decisão de Markov: um tutorial

Dialogue Control Algorithm for Ambient Intelligence based on Partially Observable Markov Decision Processes

عنوان ژورنال:

اشتراک گذاری